
The file used to conduct the statistical and regression analysis is called:

- "Main_Analysis_RESTAT.do" (Stata do file)


In the first part, this do file assembles all the data that have been constructed using the do files contained in this replication pack.
Some additional do files are constructed for robustness checks and extensions (e.g. by calculating the coagglomeration metric excluding London or the labour pooling measure over an extended number of years).
These additional datasets can be obtained by modifying the main do files. Information on the necessary changes is provided in the relevant do files and READ ME files.

Prior to conducting the regression analysis, the do file keeps, summarises and standardizes the relevant variables.
These variables are described in the do file since the data cannot be shared and the labels cannot be visualised.

The results are presented/produced in order/with labels similar to the ones in the tables of the submitted draft (e.g. Table 2 or Appendix Table 2)
The analysis also addresses some of the referees' concerns which are discussed in the paper but not necessarily tabulated.

Note also that in order to perform an instrumental variable analysis, the do file merges US data on labour pooling, input-output sharing and patents citations.
These were kindly provided by William Kerr. We cleaned the data for our purpose and made further correction to guarantee UK-USA industrial sectors lined up.

The main output of this do file is a log file called: 

-"MainAnalysis_Extended.log"